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This review of literature on progress monitoring was designed to examine the full array of curriculum- 
based measures (CBMs) in mathematics for students from preschool to secondary schools. We orga- 
nized the article around two primary concerns: the approach used to develop the measures (curriculum 
sampling or robust indicators) and the type of research necessary to establish the viability of the tasks. 
Our review addressed the technical adequacy of the measures as indicators of performance and prog- 
ress, as well as teachers’ use of the measures to improve achievement. The largest number of studies 
has been conducted at the elementary level, with less work in early mathematics or at the secondary 
level. In general, the measures have acceptable levels of reliability; the criterion validity of mathe- 
matics CBMs appears to be lower than that for reading CBMs. One important finding that emerged 
was the relatively low degree of consensus on the best approach to use in developing mathematics 
CBMs. We discuss probable reasons for this, along with implications for practice and research. 


Continuous monitoring of individual student academic prog- 
ress has long been an important aim within the field of spe- 
cial education (Deno, 1985; Fuchs, 2004). Curriculum-based 
measurement (CBM) represents an empirically supported sys- 
tem of progress monitoring that has produced demonstrated 
effects on student achievement, particularly in reading (Fuchs, 
Deno, & Mirkin, 1984; Fuchs, Fuchs, & Hamlett, 1989b; 
Jones & Krause, 1988). In recent years, as general educators 
and policymakers have emphasized greater accountability for 
schools’ efforts to teach all children, many researchers and 
practitioners have asserted the utility of progress monitor- 
ing — particularly in reading — for increasing numbers of stu- 
dents, regardless of disability status (Deno, 2003). Given 
growing attention to student achievement in mathematics, the 
practice of continuous progress monitoring should be simi- 
larly scaled up in this content domain. To do so, education 
professionals need to take stock of the empirical evidence for 
mathematics progress monitoring measures with regard to 
technical adequacy and instructional utility. 

The purpose of this article is to review the existing em- 
pirical literature on mathematics progress monitoring mea- 
sures. As we began our work on the mathematics strand of the 
Research Institute on Progress Monitoring (RIPM), we sought 
to identify the areas of greatest need in which to focus our re- 
search. A review of the literature provided the best means of 
evaluating the current status of research in mathematics CBMs. 
We have organized our article using two concerns identified 
by Fuchs (2004): (a) strategies for developing CBM tasks and 


(b) stages of research necessary to establish the viability of 
those tasks. 

Two Approaches for Developing 
CBMs in Mathematics 

Fuchs (2004) described two broad approaches for the develop- 
ment of CBM tasks. This categorization is particularly useful 
in the area of mathematics. In one approach, termed curricu- 
lum sampling , researchers have developed measures by con- 
structing representative samples of the year’s mathematics 
curriculum — taking at second grade, for instance, a larger pro- 
portion of addition and subtraction problems and, at sixth 
grade, a sampling that includes more advanced skills, such as 
division of decimals or addition of fractions. The curriculum 
sampling approach has been applied to computation, as well 
as to conceptual problems and applied mathematics skills. For 
the second approach, termed robust indicators, researchers have 
sought to identify measures that represent broadly defined 
proficiency in mathematics. Using this approach, effective 
measures are not necessarily representative of a particular cur- 
riculum, but are instead characterized by the relative strength 
of their correlations to various overall mathematics profi- 
ciency criteria. Measures in this second approach attempt to 
parallel in mathematics the kind of “robustness” that the oral 
reading CBM task offers in the area of reading: not necessar- 
ily drawn from the student’s yearly curriculum, yet offering 
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strong correlations to a host of criterion measures of overall 
subject area proficiency. In the following section, we consider 
the relative merits of each approach and their implications for 
the development of mathematics CBMs. Practitioners will need 
to weigh these advantages and disadvantages when making 
decisions about which mathematics measures to use for prog- 
ress monitoring. 

A primary advantage of the curriculum sampling ap- 
proach is the direct link it provides to the instructional cur- 
riculum, which facilitates the means to provide teachers with 
diagnostic feedback about a student’s performance with re- 
gard to specific skills or concepts. These data may assist 
teachers in designing effective remedial instruction. This di- 
rect link to the curriculum also engenders limitations associ- 
ated with this approach. Given the high level of curriculum 
specificity in mathematics, the curriculum sampling approach 
requires that different measures be developed to mirror each 
year’s mathematics curriculum. As a result, the measures model 
student growth only within a single year, not across multiple 
years of learning. Moreover, current curriculum programs im- 
plemented in the nation’s schools are quite diverse, with little 
consensus on placement of concepts in an instructional se- 
quence or relative emphasis among topics (Reys, Dingman, 
Sutter, & Teuscher, 2005). The curriculum sampling approach 
may result in measures that are wedded to a particular cur- 
riculum program, necessitating the development of multiple 
CBM systems, each linked to a specific mathematics curricu- 
lum. 

The robust indicators method offers several advantages 
from RIPM’s perspective. As the RIPM research team strives 
to create a seamless and flexible system of progress monitor- 
ing measures in mathematics, we are particularly interested in 
measures that can be used across multiple grade levels; robust 
indicators hold the promise of doing this because they are not 
linked to any particular curriculum model or instructional se- 
quence. Instead, the search for robust indicators represents an 
effort to identify aspects of core competence in mathematics. 
This measurement of core competence is designed to produce 
robust indicator data that are predictive of important outcomes 
in mathematics, regardless of the vagaries of specific cur- 
riculum programs or high stakes state tests. Moreover, the use 
of robust indicators enables students’ growth to be modeled 
over multiple years of learning. Because the robust indicators 
tap core aspects of mathematics performance, rather than a 
broad range of skills and concepts from the instructional cur- 
riculum, they are less useful to teachers in providing diag- 
nostic information about students’ strengths and weaknesses 
in particular areas. 

A Continuum of Research 
on Curriculum-Based Measures 

A three-stage continuum of CBM research described by Fuchs 
(2004) is particularly applicable to work in mathematics. As 


measures are initially developed. Stage 1 research involves ex- 
ploring the technical adequacy of their use as static (one point 
in time) indicators. This research emphasizes the reliability 
and criterion validity of the measures and the extent to which 
scores can be used to predict future performance or achieve- 
ment of important outcomes. Stage 2 research examines the 
technical characteristics of slopes generated when the mea- 
sures are used for continuous progress monitoring. Research 
at this stage examines the variability in repeated measurement 
and the extent to which slopes are reflective of student growth 
in the content area. Stage 3 research includes studies exam- 
ining the instructional utility of the measures. These applied 
studies investigate whether teachers’ use of the measures to 
inform their instructional decisions results in improved stu- 
dent achievement. 

Our review begins by describing the procedures we used 
to identify studies of CBM mathematics measures. We then 
provide a general overview of the body of Stage 1 and 2 stud- 
ies, noting approaches used to provide evidence of technical 
adequacy and suggesting guidelines for readers in evaluating 
the evidence provided for each type of measure. Following a 
summary of the technical adequacy evidence, we address the 
existing research on using mathematics CBMs to improve stu- 
dent achievement (Stage 3). We conclude the article with a 
discussion of the current state of empirical evidence for CBMs 
in mathematics and offer recommendations for both practi- 
tioners and researchers. 

Method 

The goal of RIPM is to create a seamless and flexible system 
of progress monitoring measures in reading, written expres- 
sion, and mathematics. More specifically, we are particularly 
interested in identifying or developing measures suitable for 
students of diverse ability levels across multiple grades. Our 
research team conducted a comprehensive review of the CBM 
literature in reading, written expression, and mathematics. 
The goal was to identify gaps in the technical adequacy evi- 
dence in each content area to best focus our research efforts. 

Research team members searched electronic databases 
including ERIC, Science Citation Index Expanded, Psyclnfo, 
Digital Dissertation, and the Expanded Academic Index 
using the following terms: curriculum based measurement, 
curriculum-based measurement, curriculum based measure, 
curriculum-based measure, general outcome measure, and 
progress monitoring. This process yielded 578 articles, dis- 
sertations, and reports related to CBM. Titles and abstracts 
were screened to confirm that they were related to CBM, and 
Method sections were screened to identify those that reported 
results of empirical studies of CBM (i.e., included CBM as a 
dependent or an independent variable), yielding 160 articles. 
In addition, the complete set of technical reports produced by 
the Institute for Research on Learning Disabilities (IRLD) at 
the University of Minnesota was accessed and included in the 
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review. These documents were then reviewed by a team of ed- 
ucational psychology graduate students, grouped by subject 
(reading, mathematics, spelling, and writing), and coded for 
study characteristics, such as sample size and demographics, 
type of CBM and criterion measures used, and administration 
and scoring procedures. Twenty-nine (18%) of the articles in 
this initial search addressed mathematics measures. We added 
additional papers as they were published. Our focus in this ar- 
ticle is limited to published studies and IRLD reports that ad- 
dress Fuchs’s (2004) three stages of research on mathematics 
CBMs. The one exception to our inclusion rules was a con- 
ference paper presented by Espin, Deno, Maruyama, and Cohen 
(1989) that reports results for their Basic Academic Skills 
Sample (BASS), an instrument that has been subsequently used 
in many research projects. 

This process yielded a total of 32 reports on mathemat- 
ics progress monitoring measures. We summarize the studies 
that address the technical adequacy of the measures (research 
representing Stages 1 and 2) in Table 1 according to the sam- 
ple characteristics, the type of measure, and the results re- 
ported for reliability, criterion validity, and growth. The final 
column of the table notes which stages of research are ad- 
dressed in each study. We next report studies in examining the 
effects of teachers’ use of mathematics CBMs on student 
achievement. 

Results 

Technical Adequacy of Mathematics 
CBMs for Static Measurement and 
Progress Monitoring 

The 32 identified studies included 25 documents (we refer to 
them as “studies” though some reports include multiple stud- 
ies) summarized in Table 1 that reflect an emphasis on research 
in elementary mathematics (17 studies) relative to either early 
mathematics (4 studies) or secondary mathematics (4 studies). 
A similar imbalance is evident in the relative emphasis on 
Stage 1 studies examining the use of the measures as static in- 
dicators (17 studies) in contrast to those addressing Stage 2 
(technical characteristics of slopes; 2 studies) or a combina- 
tion of Stage 1 and Stage 2 (7 studies). While early mathe- 
matics studies have relied entirely on measures developed 
using the robust indicators approach, a mixture of robust in- 
dicators and curriculum sampling measures are evident at 
both the elementary and the secondary levels. 

The reliability data include traditional approaches to 
documenting the measures’ internal consistency, test-retest, 
and alternate form reliability. Some studies (Christ, Johnson- 
Grohs, & Hintze, 2005; Hintze, Christ, & Keller, 2002) em- 
ployed generalizability and dependability coefficients. In 
evaluating the reliability data, we used the traditional bench- 
mark of .80 for typical progress monitoring applications. If 
the data were to be used for high-stakes decisions, we would 


suggest applying a more stringent criterion (above .90) for the 
measures’ reliability. 

All but one of the studies examined the validity of the 
measures by investigating concurrent and predictive criterion 
validity; Shinn and Marston (1985) supported the discriminant 
validity of the measures by comparing student performance 
across classification groups. The most common criterion mea- 
sures are individual and group achievement tests, including 
(more recently) state standards tests. Readers familiar with the 
technical adequacy data for elementary CBM reading know 
that criterion validity coefficients for oral reading fluency are 
quite high (in the .80 to .90 range). Relations between CBM 
mathematics measures and criterion variables are generally 
more modest (in the .50 to .70 range), but quite similar to co- 
efficients for commercially available achievement tests of 
mathematics (Salvia, Ysseldyke, & Bolt, 2007). 

Most studies address student growth by reporting mean 
weekly slope values representing average increases that might 
be expected in a student’s score each week. To permit com- 
parisons across studies, we converted the growth data in two 
studies (Chard et ah, 20045; Helwig & Tindal, 2002) that were 
originally reported in other formats into weekly slope rates. 
One study (Espin et al., 1989) examined cross-sectional means 
across grade levels on CBM mathematics measures. Rates of 
growth vary on the measures and appear to be influenced by 
the complexity of the responses required of students. Re- 
search in this area is too immature to offer firm evaluative cri- 
teria, but we would assert that measures on which normative 
growth is more rapid will prove to be more valuable to prac- 
titioners than those on which growth is so slow that several 
weeks’ time would be needed to see a single unit of increase 
in a student’s score. Rate of growth, of course, is only one as- 
pect of a measure’s sensitivity. The amount of variability 
around typical growth rates will also affect practitioners’ abil- 
ity to detect changes in student growth. Thus, rate of growth 
and variability must be examined when selecting progress 
measures. Research and development efforts on progress mea- 
surement must explore both scale properties and methods to 
control sources of variability that reduce the sensitivity of 
measures. 

Progress Monitoring Measures for Early Mathe- 
matics. Research on early mathematics measures has focused 
entirely on robust indicators, emphasized measures of nu- 
meracy, and been conducted exclusively with general edu- 
cation students. Two groups of researchers account for all of 
the published studies in early mathematics. Clarke and his 
colleagues (Chard et al., 2005; Clarke & Shinn, 2004) have 
examined measures such as quantity discrimination and iden- 
tifying the missing number in a counting sequence with students 
in kindergarten and first grade. In the quantity discrimination 
task, students are presented with two numbers and asked to 
name the larger number. VanDerHeyden and her colleagues 

(text continues p. 131 


TABLE 1. Technical Adequacy Studies of Mathematics Progress Monitoring Measures 
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(VanDerHeydenet al., 2004; VanDerHeyden, Witt, Naquin, & 
Noell, 2001) have conducted studies with prekindergarten 
and kindergarten students using measures such as circling 
numbers, drawing numbers, and drawing circles. Measures com- 
mon to both groups of researchers include number naming/ 
identification and counting tasks. The reliability of most of 
the early mathematics measures has been quite strong, and 
criterion validity coefficients are generally in the low to mod- 
erate range (.40 to .60) for students through kindergarten; 
stronger correlations have been obtained with first-grade stu- 
dents (above .70) using the quantity discrimination and miss- 
ing number tasks. 

The existing research in early mathematics has focused 
predominantly on Stage 1 concerns. Only two studies exam- 
ined students’ growth over time. Clarke and Shinn (2004) 
computed mean weekly growth rates on four measures across 
a 26-week period and found that an oral counting measure 
produced the highest level of mean growth. Chard et al. (2005) 
reported mean scores from fall to spring (a 32-week period) 
on three different measures. The number identification mea- 
sure produced substantially higher levels of mean growth than 
did the other two measures. All the studies in early mathe- 
matics have been published since 2000, reflecting the early 
stage of research in this area. The early mathematics measures 
with the most supporting evidence at this point include num- 
ber identification, quantity discrimination, and missing num- 
ber, though much research remains to be conducted. Future 
research should examine sensitivity of the measures to stu- 
dent growth and whether teachers ’ use of data from these mea- 
sures can improve student achievement. 

Progress Monitoring Measures for Elementary 
Mathematics. The 17 studies addressing measures for ele- 
mentary mathematics represent a variety of measures, re- 
search strategies, and student populations. Over three quarters 
of the studies have been conducted with general education 
students and address Stage 1 research issues. The majority of 
the studies have relied on curriculum sampling measures or a 
mix of curriculum sampling and robust indicators. When re- 
searchers have used curriculum sampling measures, the most 
common options included either the Monitoring Basic Skills 
Progress measures (MBSP; Fuchs, Hamlett, & Fuchs, 1998, 
1999) or a sampling of computation skills within a grade span 
(Christ et al., 2005). Given the pervasiveness of the MBSP 
measures in the literature, a more thorough description of 
these measures is appropriate. 

Two types of MBSP mathematics measures are avail- 
able: (a) Computation and (b) Concepts and Applications, 
each packaged with a software program that supports teach- 
ers’ use of the data to make instructional decisions. The Com- 
putation measure consists of 30 parallel forms at each of 
Grades 1 through 6. Fuchs et al. created the measures by se- 
lecting problem types representing a proportional sampling of 
the computation skills within the Tennessee state curriculum 
at each respective grade level. The Concepts and Applications 


measure, available for students in Grades 2 through 6, were 
developed using a similar process, but with attention to applied 
skills and concepts (i.e., reading charts and graphs, under- 
standing numbers, and solving word problems). Administra- 
tion times range from 2 to 6 min for the Computation probes 
and from 6 to 8 min for the Concepts and Applications probes. 
Two studies examined a problem-solving performance as- 
sessment that represented core skills within the grade-level in- 
structional curriculum (Fuchs et al., 2000; Fuchs et al., 2003); 
a single study relied on a sampling of word problems from 
the instructional curriculum (Jitendra, Sczesniak, & Deatline- 
Buchman, 2005). Robust indicators at the elementary level 
have focused exclusively on basic facts (e.g., Epstein, Pol- 
ioway, & Patton, 1989). The majority of these measures are 
1- or 2-min probes that sample either facts for a single oper- 
ation or a set of mixed facts representing all four operations. 

The reliability of the elementary mathematics measures 
was generally quite strong, with most coefficients well above 
.80. Exceptions to this pattern included the measures address- 
ing more complex mathematics tasks such as word-problem 
solving and problem-solving performance assessments, which 
have been less reliable. Initial studies involving basic facts 
(Tindal, Germann, & Deno, 1983; Tindal, Marston, & Deno, 
1983) reported somewhat lower reliability coefficients (par- 
ticularly for multiplication and division) than did other stud- 
ies involving single skill or mixed operation facts. Across all 
types of measures, alternate form reliability estimates tended 
to be lower than test-retest reliability coefficients were, sug- 
gesting potential variability in student performance across 
parallel forms. The generalizability study results confirmed 
this finding; multiple skill probes representing more complex 
performance required a greater number of forms to produce a 
stable estimate of student performance than did single skill 
probes (Hintze et al., 2002). 

The MBSP measures and the word-problem solving mea- 
sures developed by Jitendra et al. (2005) demonstrated the 
highest levels of criterion validity, with the majority of the co- 
efficients in the .60 to .80 range. Only two studies addressed 
the criterion validity of basic facts measures (Espin et al., 1989; 
Thurber, Shinn, & Smolkowski, 2002); correlation coefficients 
between these measures and other measures of mathematics 
computation ranged from .30 to .60. In addition to criterion 
validity, Shinn and Marston (1985) provided evidence for the 
validity of mixed computation and basic facts measures by 
demonstrating significant differences in the performance of 
three groups of students (general education. Title 1 , and special 
education) in the expected directions for all of the measures. 

Student growth on the elementary measures has most 
often been examined by computing mean weekly slope val- 
ues for students in general education or special education sam- 
ples. All but one of the studies (Espin et al., 1989) examining 
growth was conducted using the MBSP measures; Espin et al. 
examined mean scores on a basic facts task for students in 
Grades 1 to 6 and found a pattern of positively accelerating 
means across grade levels. Most of the growth research on the 
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MBSP measures has relied on general education students to 
document normative rates of growth; Shapiro, Edwards, and 
Zigmond (2005) reported mean slopes for students with dis- 
abilities. Weekly growth rates on both the MBSP Computa- 
tion and Concepts and Applications measures ranged from 
about one quarter of a point to three quarters of a point per week. 
Shapiro et al. (2005) obtained mean cross-grade estimates of 
.38 units per week on both MBSP measures. Within-grade 
mean slopes revealed more variability, ranging from 0 to .72. 

The research base for mathematics CBMs at the elemen- 
tary level is most extensive with regard to the technical ade- 
quacy of the measures as static indicators. While basic facts 
tasks have been investigated in the largest number of studies, 
only limited data have documented the criterion validity of 
these measures, and no data on typical weekly slopes are 
available to serve as a context against which to index student 
growth on these measures. The recently developed problem- 
solving measures and problem-solving performance assess- 
ments suffer from a similar lack of data; there is a clear need 
to examine the viability of these measures for documenting 
student growth. The most well established measures at the el- 
ementary level are the MBSP measures (Fuchs, Hamlett, & 
Fuchs, 1998, 1999). These measures have empirical data sup- 
porting their reliability, criterion validity, and sensitivity in de- 
tecting student growth. 

Progress Monitoring Measures for Secondary 
Mathematics. Two groups of researchers account for all of 
the published studies of mathematics progress monitoring 
measures for secondary students; research in this area has ad- 
dressed both Stage 1 (technical adequacy of static measures) 
and Stage 2 (growth). It is important to note that we did not 
identify a single study addressing mathematics CBMs for high 
school students. Foegen and her colleagues (Foegen, 2000; 
Foegen & Deno, 2001) examined a basic facts measure and 
an estimation measure, both developed using the robust indi- 
cators model. The 3-min estimation measure consists of 
40 items, with half requiring computation estimation and 
half requiring estimation of contextual (word) problems. Stu- 
dents select one of three options as the best estimate for the 
problem; the alternatives differ by a factor of 10 to encourage 
students to use estimation and number sense, rather than com- 
putation skills, as they respond to the task. Helwig and his col- 
leagues (Helwig, Anderson, & Tindal, 2002; Helwig & Tindal, 
2002) created an eighth-grade measure that reflected concep- 
tual understanding of mathematics, rather than computation 
skills. They investigated a large pool of items likely to predict 
grade-level mathematics achievement on a statewide test and 
selected a subset of items that together accounted for the 
greatest variance in student achievement. Unlike many other 
forms of CBM tasks, their probes were untimed, though they 
noted that most students completed the 15-item measures in 
less than 10 min. 

Reliability data for the middle school measures is near 
or above the .80 threshold, with the exception of one of three 


variations of an estimation task used by Foegen and Deno 
(2001). Criterion validity data have included district and state 
achievement tests, as well as teacher ratings and grade-based 
measures. The validity coefficients reported by Foegen and 
her colleagues for the middle school robust indicators ranged 
from .29 to .66, with the majority of the measures in the .40 
to .50 range. Helwig and his colleagues’(Helwig, Anderson, 
& Tindal, 2002; Helwig & Tindal, 2002) measures demon- 
strated strong criterion validity with a state mathematics stan- 
dards test, with coefficients for general education students 
exceeding .80. 

Both groups of researchers have examined student 
growth observed on the measures. Student performance on the 
concept measures developed by Helwig et al. increased over 
four administration periods at levels that were statistically 
significant, but their practical significance remains question- 
able (mean weekly growth of .04 units/week). The limited 
change in student performance may be associated with the 
small number of items (15) included in this measure. The 
measures investigated by Foegen et al. produced weekly slope 
values more comparable to those obtained for the elementary 
measures (.25 points/week for estimation and .55 for basic 
facts). 

The status of research in secondary mathematics prog- 
ress monitoring parallels that of early mathematics. All of the 
secondary research has been published since 2000; clearly, 
work in this area is also in its infancy. As with the early math- 
ematics measures, considerable research is needed to further 
establish these measures. Of the three types of potential mea- 
sures that have been investigated to date (facts, estimation, 
concepts), the concepts measures are severely limited by the 
negligible change in student performance over time. While 
these measures were tremendously successful in predicting 
student performance on criterion tests (Fuchs et al., 1994), it 
is as yet undetermined whether they can be successfully used 
as a progress monitoring measure. As discussed, research on 
these measures must be conducted to determine whether they 
are sufficiently sensitive to growth over short periods of time. 
A pressing need in the area of secondary CBM is for mea- 
sures appropriate for high school students. 

Stage 3 Studies: Teachers’ Use of CBM 
Data to Improve Student Achievement 

The 32 studies we identified included 7 that examined Stage 3 
issues of instructional utility. In identifying these studies, we 
focused on those that specifically examined factors associated 
with teachers’ use of mathematics CBM and the relation be- 
tween these practices and student achievement. As a result, 
we did not include studies in which mathematics CBM was a 
component of a larger intervention; this approach has been 
widely used in Fuchs and colleagues’ work on Peer Assisted 
Learning Strategies (PALS). Readers interested in a summary 
of research including PALS studies are referred to Stecker, 
Fuchs, and Fuchs (2005). 
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This final group of studies is summarized in Table 2. We 
have opted to present these studies chronologically to better 
capture the developmental nature of work in this area. 

Fuchs and her colleagues have conducted all seven stud- 
ies, and six of the seven investigated the MBSP Computation 
measure with students in special education. Stage 3 studies in 
mathematics have examined the viability of enhancements 
to the original CBM system, the effects of using problem- 
solving performance assessment data to drive classroom in- 
struction, and the importance of individualizing decisions 
based on CBM data. In the section that follows, we briefly re- 
view these studies. 

Two studies examined enhancements to the MBSP soft- 
ware used to support teachers’ implementation of CBM. The 
earliest study of teachers’ use of mathematics CBM (Fuchs, 
Fuchs, & Hamlett, 1989a) compared the effects of setting a 
static goal with the effects of more dynamic goal adjustment 
based on student data. Teachers in the dynamic goal condition 
were prompted by the MBSP software to raise a student’s goal 
when appropriate; teachers in the static goal condition were 
allowed to change students’ goals, but not prompted to do so. 
The achievement of students in the dynamic goal condi- 
tion was higher than that in either the static goal condition or 
a control group. Fuchs, Fuchs, Hamlett, and Stecker (1990) 
examined the addition of a skills analysis component to the 
MBSP software program: a chart representing individual 
students’ performance data on specific computation skills. 
Fuchs et al. (1990) determined that teachers who had access 
to skills analysis effected greater achievement gains in their 
students. 

Three studies investigated other means of supporting 
teacher implementation. Fuchs, Fuchs, Hamlett, and Stecker 
(1991) contrasted the addition of a computerized expert sys- 
tem to the CBM with skills analysis. Results indicated that 
students of teachers who used the expert system outperformed 
both those in the no-expert system condition and those in a 
control group. Allinder and BeckBest (1995) compared the ef- 
fects of university -based consultation with teachers’ self- 
monitoring of their CBM implementation. Although students 
in both conditions improved their achievement levels from 
pre- to posttest, no significant differences were observed be- 
tween the two conditions. Allinder, Bolling, Oats, and Gagnon 
(2000) compared CBM with and without self-monitoring to a 
contrast condition with no CBM. The self-monitoring teach- 
ers completed written questionnaires each time the software 
prompted a change in instruction. Student achievement for the 
CBM with the self-monitoring group exceeded that of the 
CBM-only group and that of a contrast group in which teach- 
ers were not using CBM. However, no differences in slope 
were obtained for the CBM groups. 

Fuchs, Fuchs, Karns, Hamlett, and Katzaroff’s (1999) 
study of teachers’ use of problem-solving performance as- 
sessment data examined achievement among three groups of 
students (those above, at, and below grade level in their math- 
ematics performance) in two conditions. Teachers of students 


in the performance assessment (PA) condition were provided 
support in administering, scoring, and adapting instruction 
based on performance assessment data; students in the con- 
trol (No PA) condition completed the pre- and posttest as- 
sessments only. Although overall achievement favored students 
in the PA condition, analyses of the data by student perfor- 
mance levels revealed that these differences were evident only 
for students at or above grade level in their mathematics per- 
formance. Students in both the PA and the No PA conditions 
who were performing below grade level had similar levels of 
posttest achievement. 

Stecker and Fuchs (2000) examined the importance of in- 
dividualizing instructional changes based on student data. In 
this study, special education teachers worked with pairs of stu- 
dents, implementing typical CBM procedures and adjusting 
instruction based on only one of the student’s data. Results of 
the study indicated that while all students demonstrated gains 
from pre- to posttest, the achievement of target students, upon 
whose CBM data instructional adjustments were made, ex- 
ceeded that of their partners receiving similar instruction. 

The Stage 3 studies conducted to date provide support 
for the conclusion that teachers can use data from the MBSP 
Computation measure to improve student achievement. Data 
from performance assessment measures were found to pro- 
duce increased achievement for students at or above grade 
level, but these same benefits were not obtained for students 
performing below grade level. It is interesting to note that an 
enhancement found to be superior to a contrast condition in 
one study (i.e., CBM with skills analysis in Fuchs, Fuchs, 
Hamlett, & Stecker, 1990) is subsequently found not to differ 
from a contrast condition when compared to a further en- 
hanced version (i.e., CBM with skills analysis and expert sys- 
tem support; Fuchs, Fuchs, Hamlett, & Stecker, 1991). These 
counterintuitive patterns in the Stage 3 research have been 
discussed by Stecker et al. (2005); further research on the dy- 
namics of expert consultation and research training are war- 
ranted. Moreover, there is a clear need to expand the breadth 
of research on teachers’ use of mathematics CBM to improve 
student achievement. To date, no studies have been conducted 
in early mathematics or middle school, nor have any of the stud- 
ies examined robust indicators. We do not yet know whether 
teachers can use data from robust indicators to improve their 
students’ mathematics achievement. 

Discussion 

Our review of empirical studies of progress monitoring mea- 
sures in mathematics revealed that the bulk of the existing re- 
search has been conducted at the elementary level, with only 
limited work in early mathematics and middle school mathe- 
matics measures. It is significant to note that we were unable 
to identify any studies addressing high school mathematics. 
We situated our review by contrasting measures developed 
using a robust indicators approach with those developed using 
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( Table 2 continued) 


135 


M) 

*3 

Ph 


5-1 JJ CD 

<d 73 Wj 

.18 3 

£ s-f 

"O 6 


c/3 OG 

<3 C 

5-1 CJ 

£ 'll 
R c 

CJ qj 


.o cd 
n w 


a> 

Q- 

F^ 


M) 

V3 C i 

c/3 •— i : 

flj 5 h 

5 h O | 

fto.ts ! 

O S r 

£ § 


Cd 

C/5 


73 

3 


Tt" 


c 

o 

U 


03 

U 


03 

U 


GO 

+ 


o 

<N 




co 

+ 


*2 
^ 03 


CD 

P 03 


Pj5 2 

3 h P5 


UUwU^uU 

ttS 


+ > 
03 03 

u u 


Q , 

Pjj & 

t-H 

ys 

< 03 
U 


bD 

C 


uS 

‘ 03 


u u 


co 

O 


ss 

D- 

£ 

o 

U 


PJ PJ 
CO CO 
— < 7T 
co <n 


W <D 
S3 cd 

.2 a 

ca <D 

f 8P 

O- 5-1 

fi (L) 

U ... II 
J3 eg — 

1 1 > 

£ §>J2 


e C 

.£ o 
O §1 

«5 

■§* 
c 


- o 
o 

3 cd Q 

< O 



o^ 

h 1 - 

O 


Cd *3, 

S3 £ 
o E 
•■JO 73 

8 3 

U C/3 
CO 

S § 

-a "a 


Kf\ C/3 

£® <d 
£ &p. 


CD 

, bD 
S3 s 
§1 

73 ^ 
cd 

.> s3 

CD O 

O 'Z3 

CD CJ 


— cd 
O 73 


>c-h cd 

2 -S 

03 
U 


co 

U 


55 I 

03 £ 

S E 

o 

O 


s g 
S E 

Oh 


« e 

« U cd 


CCS cd 


CLh *S 
PJ * 


PJ pq 
CO CO 
(N -sj- 
(N 00 


3 

PP 


■a jo 


a 6 


43 .a cd £ 
■d cd W) 5 
cd rv j3 J O- 


OJJ JS 

O- £ 02- 
cd 

tN ^ 


<N 


O 

o 

o 

(N 


06 

u 


II 

H 

U 


II 'I 

w w 
o ? 


D- C 

!5 

i eS 


S 3 


<u 06 
g £; 
cd O 


H s: 

a cs 


CQ 5 

<-> <3 




136 THE JOURNAL OF SPECIAL EDUCATION VOL. 41/NO. 2/2007 


curriculum sampling (Fuchs, 2004). Although research in early 
mathematics has drawn entirely on the robust indicators ap- 
proach, studies at the elementary and middle school levels 
have included measures representing both approaches to de- 
velopment. We did not discern clear patterns in the results fa- 
voring one approach over the other. Only three of the studies 
(Hintze et al., 2002; Shinn & Marston, 1985; Thurber et al., 
2002) provided a direct comparison of measures drawn from 
the two approaches. Results of these studies suggest that both 
types of measures can discriminate between students of vary- 
ing mathematics ability levels. Thurber et al. (2002) found that 
a simple basic facts task (robust indicator) was more strongly 
correlated with both computation and application outcome 
measures than was a curriculum-sampled computation mea- 
sure. A notable finding in this study was that the strength of 
the correlations for a CBM reading measure exceeded those 
for either CBM mathematics measure on four of the five cri- 
terion measures. Hintze et al. (2002) determined that more 
complex measures developed through curriculum sampling 
required that students complete multiple forms to reliably 
gauge mathematics performance levels. 

We also examined the nature of the research that has 
been conducted, using Fuchs’s (2004) three stages. The bulk 
of the research in CBM mathematics has comprised Stage 1 
studies investigating the technical adequacy of the measures 
as static (one point in time) indicators. Stage 2 studies inves- 
tigating the technical characteristics of student slopes derived 
from continuous progress monitoring have been conducted 
for three early mathematics measures: basic facts, the MBSP 
measures, and three types of middle school measures. The ma- 
jority of these studies have reported mean weekly slope val- 
ues, most of which are less than .5 units per week. Only two 
measures (MBSP Computation and problem-solving perfor- 
mance assessments) have been investigated in Stage 3 studies 
exploring the effects of teachers’ use of CBM data to improve 
student achievement. Both of these measures have been used 
successfully to increase student achievement over comparison 
conditions; however, patterns in the results suggest that teach- 
ers’ use of CBM likely needs to be paired with some form of 
support (consultation or self-monitoring) to produce achieve- 
ment gains that exceed those obtained in typical practice. 

Implications for Practice 

Having taken stock of the empirical support for mathematics 
progress monitoring measures, what recommendations can 
we offer to teachers? At the early mathematics and middle 
school levels, the research base is not yet sufficient to support 
strong recommendations for particular measures. The early 
mathematics measures used by Clarke, Chard, and colleagues, 
as well as the middle school measures used by Foegen and 
colleagues offer minor advantages over other measures at 
these levels. These measures have adequate initial evidence 
supporting their technical adequacy as static measures, as well 
as evidence for their use as indicators of progress. At the el- 


ementary level, the MBSP measures, basic facts, and two 
forms of problem-solving measures all have demonstrated ev- 
idence supporting their use as static indicators. For progress 
monitoring, the MBSP measures have the greatest level of em- 
pirical support. The MBSP Computation measure is the only 
measure with evidence at all three stages of research. 

In selecting a measure, teachers must also consider the 
relative advantages and limitations associated with the cur- 
riculum sampling and robust indicators approaches to devel- 
oping measures. As an example, before teachers select the 
MBSP Computation measure, they should consider the degree 
to which the curriculum represented by this measure is simi- 
lar to the instructional curriculum in their districts. Studies in 
reading CBM have examined the effects of passages drawn 
from curriculum programs with varying emphases and deter- 
mined that these differences do not diminish the technical ad- 
equacy of the oral reading fluency measure (Fuchs & Deno, 
1991). It is not clear whether this result, based on a robust in- 
dicator in reading, would also be found for a curriculum sam- 
pling measure in mathematics. We encourage practitioners 
selecting curriculum sampling measures to consider the de- 
gree of “match” between the content of the measures and that 
of their curriculum. This is particularly important in mathe- 
matics given the recent release of Curriculum Focal Points, 
by the National Council of Teachers of Mathematics (2006). 
This document, which articulates a limited number of specific 
instructional areas for each grade level (Pre-K through Grade 
8), may have substantial influence on future curricula and 
standards-based assessments, which in turn may affect crite- 
rion validity of current curriculum sampling measures. 

A final consideration for teachers is the issue of expected 
weekly growth. Existing research in mathematics CBM sug- 
gests that students do not “grow” as rapidly when assessed 
using mathematics measures as they do in reading when the 
index is oral reading fluency. Teachers who are choosing a 
measure to monitor student progress must consider the typi- 
cal rates at which students improve on the various measures 
and the importance of being able to observe small increments 
of improvement in student performance. If a measure has a 
mean weekly slope of .20, five weeks of data may need to be 
collected before teachers could expect to see a single unit in- 
crease in the student’s weekly score. For students who expe- 
rience severe difficulties in mathematics, this limited degree 
of sensitivity to growth may not meet teachers’ needs. Unfor- 
tunately, many of the mathematics measures have produced 
mean weekly slope values that are less sensitive to growth 
than teachers might desire. Furthermore, the research con- 
ducted to date is limited by small sample sizes and high lev- 
els of variability in mean slopes across grade levels, 
particularly for curriculum sampling measures for which the 
content of the measure changes with each grade level. We en- 
courage teachers to be aware of the existing research data 
when selecting among the options for progress monitoring 
measures in mathematics. We also urge researchers to attend 
to issues of sensitivity to growth as they develop measures. 
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Contrasting CBM in Mathematics 
With CBM in Reading 

The search for technically and theoretically appropriate mea- 
sures, largely resolved in CBM of reading, remains active in 
mathematics. It is both theoretically interesting and practi- 
cally important that the expanding research on progress mon- 
itoring in mathematics has not resulted in converging answers 
to the “What to measure?” or “How to measure?” questions 
posed by Deno and Fuchs (1987). Fuchs’s (2004) distinction 
between robust indicators and curriculum sampling may offer 
a potential explanation for why converging evidence for an- 
swering these questions is more difficult in mathematics than 
it is in reading. Mathematics curricula are created by sequenc- 
ing the types of mathematics competence students are to learn 
within and across grade levels. In the traditional parlance of 
curriculum developers, this is the “scope and sequence” of 
the mathematics curriculum. Different types of mathematical 
competence are addressed: geometry, measurement, and prob- 
ability, for example. Even when “problem solving” is identi- 
fied as the central focus of mathematics, extensive diversity 
exists in the types of problems that students learn to solve 
across the grades. 

Consider reading curricula in contrast to mathematics. 
From the outset, the goal of reading seems clear and relatively 
simpler than the goal in mathematics. To read, students must 
learn to translate printed text into their oral language — to 
“say and understand” the printed word. Once this initial skill 
is acquired, growth is discernible through increasing levels of 
fluency in this performance. The range of vocabulary, the 
complexity of the language structures, and the variety of back- 
ground knowledge necessary for adequate comprehension in- 
crease, yet the nature of the task remains relatively constant. 
Unlike mathematics curricula, the scope and sequence of read- 
ing curricula less address the fundamental act of reading than 
they identify embellishments on this fundamental act. This 
difference is what makes it possible to think that a child might 
“learn to read” in first grade and become quite competent by 
third grade — that in the upper grades a child is “reading to 
learn” rather than “learning to read.” Comparable ideas do not 
make sense in mathematics. It does not fit the structure of our 
language to say a child “learns to math,” and it makes no sense 
to think that students are no longer learning mathematics in 
higher grades. 

Given the differences in the curriculum design of read- 
ing and mathematics, we wonder whether the inherent char- 
acter of the mathematics curriculum militates against finding 
a robust indicator that can approximate the level of criterion 
validity that has been attained for the CBM reading progress 
measures. Perhaps it is the case that learning mathematics, un- 
like reading, requires the acquisition of increasingly differen- 
tiated types of knowledge as students move to higher levels 
of development. If the difference between developing com- 
petence in mathematics and reading is manifest in the design 
of those curricula, then we might have to develop progress 


measures consistent with those differences — that is, a robust 
indicator approach in reading and a curriculum sampling ap- 
proach in mathematics. 

Are there alternative conceptions of mathematics that 
might make a robust indicator approach possible or that might 
enable some combination of the two approaches? We have 
considered several alternatives. For example, we wonder if 
core competence in mathematics can be identified that, like 
reading, is fundamental to success in mathematical thinking 
and problem solving. Suppose, instead of identifying “math- 
ematics” as the focus, we identify “numeracy” — a concept 
comparable to literacy. Might a robust indicator of “early nu- 
meracy” be developed that is comparable to the CBM oral 
reading measure? Might growth in an indicator of numeracy 
be useful in making instructional decisions, as has been the 
case of growth in passage reading? Perhaps core competence 
exists and can be measured at different levels of development. 
Early numeracy measures might be appropriate for Grades 1 
to 3, and different types of core competence can be identified, 
and growth in those competencies might be measured in 
Grades 4 to 6, 7 to 8, and so on. Finally, like the term read- 
ing , the term problem solving seems central to mathematics. 
Clearly, the types of problems that students learn to solve 
change as they move through the curriculum. At the same 
time, it might be possible to identify a corpus of problems at 
primary, intermediate, and middle school grade levels that 
could be repeatedly sampled to index growth in problem solv- 
ing skill at those grades. Indeed, state standards tests might 
represent cultural expectations and help to define the para- 
meters of such problems. 

Implications for Research 

Research is desperately needed in several areas of mathemat- 
ics progress monitoring. A primary need is for work at the 
grade levels at which research is extremely limited or nonex- 
istent. The complete absence of any research involving high 
school mathematics represents a significant gap in our knowl- 
edge. Given increasing expectations that all students will mas- 
ter challenging curricula, teachers of students with disabilities 
and students likely to struggle in mathematics have a critical 
need for tools to support their efforts to provide effective in- 
structional programs. Rather than focusing on grade levels or 
general mathematics skills and concepts, as has been common 
in the elementary and middle school research, progress mon- 
itoring tools for high school mathematics will need to address 
particular content domains, such as algebra and geometry. Al- 
though initial work exists at the early grades and in middle 
school, a similar need exists to expand research for these pop- 
ulations. 

Extending research efforts beyond Stage 1 (technical ad- 
equacy of the measures as static indicators) is another critical 
need in the field. The pool of measures about which we have 
some evidence of their use as progress indicators is substan- 
tially smaller than is the pool of measures with evidence for 
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their use as static indicators. Furthermore, Fuchs and her col- 
leagues are the only researchers who have conducted studies 
investigating teachers’ use of progress monitoring data to im- 
prove student achievement. Only a single mathematics mea- 
sure (MBSP Computation) has evidence supporting its use to 
improve student achievement for students with disabilities. 
We anticipate that teachers can use other measures with pos- 
itive effects on student achievement, but until such research 
is conducted, this remains an empirical question. We urge 
researchers to move beyond Stage 1 investigations of mathe- 
matics measures and explore the important questions associ- 
ated with students’ growth on the measures and achievement 
outcomes. 

The empirical and practical consequences associated with 
the two approaches to developing measures have yet to be in- 
vestigated. While the robust indicator approach seems more 
suited for RIPM’s efforts to develop a seamless and flexible 
system of progress monitoring measures, the bulk of the re- 
search to date has focused on curriculum sampling measures. 
Can either or both of the approaches produce a durable mea- 
sure, useful across a range of grade levels? Is use of more than 
one measure necessary to capture mathematics proficiency? 
Might we need one type of measure to predict important fu- 
ture outcomes, but a different measure, more sensitive to small 
changes in student performance, to monitor student growth? 

These issues and questions invite an increase in research 
on CBM in mathematics. Mathematics proficiency is receiv- 
ing national attention as a means of increasing educational and 
employment outcomes for students and enhancing the quality 
and competitiveness of the U.S. workforce (Cavanagh, 2006). 
As more emphasis is placed on improving student outcomes 
in mathematics, there will be a growing demand for progress 
monitoring tools that can assess current student learning, pre- 
dict future performance, and support teachers’ efforts to de- 
sign effective instruction. To meet this need, we and our 
colleagues are pursuing mathematics progress monitoring re- 
search through RIPM to address the areas of greatest impor- 
tance. We are conducting studies of robust indicators across 
grade levels, investigating teachers’ use of early mathematics 
measures to improve student achievement, and conducting re- 
search in middle schools and in high school algebra. Students 
deserve to make meaningful and maximal progress in mathe- 
matics; educators, then, deserve a comprehensive research 
base supporting the integrity of their efforts to monitor and 
increase student growth. 

AUTHORS’ NOTE 

The Research Institute on Progress Monitoring (RIPM) at the Uni- 
versity of Minnesota is funded by the U.S. Department of Education, 
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